Associating microbiome composition with environmental covariates using generalized UniFrac distances
نویسندگان
چکیده
MOTIVATION The human microbiome plays an important role in human disease and health. Identification of factors that affect the microbiome composition can provide insights into disease mechanism as well as suggest ways to modulate the microbiome composition for therapeutical purposes. Distance-based statistical tests have been applied to test the association of microbiome composition with environmental or biological covariates. The unweighted and weighted UniFrac distances are the most widely used distance measures. However, these two measures assign too much weight either to rare lineages or to most abundant lineages, which can lead to loss of power when the important composition change occurs in moderately abundant lineages. RESULTS We develop generalized UniFrac distances that extend the weighted and unweighted UniFrac distances for detecting a much wider range of biologically relevant changes. We evaluate the use of generalized UniFrac distances in associating microbiome composition with environmental covariates using extensive Monte Carlo simulations. Our results show that tests using the unweighted and weighted UniFrac distances are less powerful in detecting abundance change in moderately abundant lineages. In contrast, the generalized UniFrac distance is most powerful in detecting such changes, yet it retains nearly all its power for detecting rare and highly abundant lineages. The generalized UniFrac distance also has an overall better power than the joint use of unweighted/weighted UniFrac distances. Application to two real microbiome datasets has demonstrated gains in power in testing the associations between human microbiome and diet intakes and habitual smoking. AVAILABILITY http://cran.r-project.org/web/packages/GUniFrac
منابع مشابه
PERMANOVA-S: association test for microbial community composition that accommodates confounders and multiple distances
MOTIVATION Recent advances in sequencing technology have made it possible to obtain high-throughput data on the composition of microbial communities and to study the effects of dysbiosis on the human host. Analysis of pairwise intersample distances quantifies the association between the microbiome diversity and covariates of interest (e.g. environmental factors, clinical outcomes, treatment gro...
متن کاملComparisons of Distance Methods for Combining Covariates and Abundances in Microbiome Studies
This article compares different methods for combining abundance data, phylogenetic trees and clinical covariates in a nonparametric setting. In particular we study the output from the principal coordinates analysis on UNIFRAC and WEIGHTED UNIFRAC distances and the output from a double principal coordinate analyses DPCOA using distances computed on the phylogenetic tree. We also present power co...
متن کاملSex differences and hormonal effects on gut microbiota composition in mice
We previously reported quantitation of gut microbiota in a panel of 89 different inbred strains of mice, and we now examine the question of sex differences in microbiota composition. When the total population of 689 mice was examined together, several taxa exhibited significant differences in abundance between sexes but a larger number of differences were observed at the single strain level, su...
متن کاملA Walnut-Enriched Diet Affects Gut Microbiome in Healthy Caucasian Subjects: A Randomized, Controlled Trial
Regular walnut consumption is associated with better health. We have previously shown that eight weeks of walnut consumption (43 g/day) significantly improves lipids in healthy subjects. In the same study, gut microbiome was evaluated. We included 194 healthy subjects (134 females, 63 ± 7 years, BMI 25.1 ± 4.0 kg/m²) in a randomized, controlled, prospective, cross-over study. Following a nut-fr...
متن کاملVariable Selection for Sparse Dirichlet-multinomial Regression with an Application to Microbiome Data Analysis.
With the development of next generation sequencing technology, researchers have now been able to study the microbiome composition using direct sequencing, whose output are bacterial taxa counts for each microbiome sample. One goal of microbiome study is to associate the microbiome composition with environmental covariates. We propose to model the taxa counts using a Dirichlet-multinomial (DM) r...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 28 شماره
صفحات -
تاریخ انتشار 2012